AITopics | observable linear dynamical system

Collaborating Authors

observable linear dynamical system

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Logarithmic Regret Bound in Partially Observable Linear Dynamical Systems

Neural Information Processing SystemsDec-24-2025, 20:54:13 GMT

We study the problem of system identification and adaptive control in partially observable linear dynamical systems. Adaptive and closed-loop system identification is a challenging problem due to correlations introduced in data collection. In this paper, we present the first model estimation method with finite-time guarantees in both open and closed-loop system identification. Deploying this estimation method, we propose adaptive control online learning (AdapOn), an efficient reinforcement learning algorithm that adaptively learns the system dynamics and continuously updates its controller through online learning steps. AdapOn estimates the model dynamics by occasionally solving a linear regression problem through interactions with the environment. Using policy re-parameterization and the estimated model, AdapOn constructs counterfactual loss functions to be used for updating the controller through online gradient descent. Over time, AdapOn improves its model estimates and obtains more accurate gradient updates to improve the controller. We show that AdapOn achieves a regret upper bound of $\text{polylog}\left(T\right)$, after $T$ time steps of agent-environment interaction. To the best of our knowledge, AdapOn is the first algorithm that achieves $\text{polylog}\left(T\right)$ regret in adaptive control of \textit{unknown} partially observable linear dynamical systems which includes linear quadratic Gaussian (LQG) control.

logarithmic regret bound, name change, observable linear dynamical system, (7 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.60)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.60)

Add feedback

ef8b5fcc338e003145ac9c134754db71-Paper.pdf

Neural Information Processing SystemsAug-17-2025, 05:08:35 GMT

artificial intelligence, controller, machine learning, (13 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Middlesex County > Belmont (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > Sweden > Östergötland County > Linköping (0.04)

Industry: Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.94)

Add feedback

propose the first finite-time system identification algorithm for partially observable linear dynamical systems (LDS)

Neural Information Processing SystemsAug-17-2025, 05:08:23 GMT

We thank the reviewers for their effort and insightful comments during these unprecedented times. LQR & LQG are among few continuous settings where the optimal policies exist (and mainly have closed form) [1]. Therefore, we do not see why this paper would be less relevant to our community. If PE is absent, we provide two general algorithms stated in Cor. The agent uses a warm-up period of O ( T) after which it commits to a controller yielding a regret of T .

algorithm, artificial intelligence, machine learning, (12 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.75)

Add feedback

Review for NeurIPS paper: Logarithmic Regret Bound in Partially Observable Linear Dynamical Systems

Neural Information Processing SystemsFeb-8-2025, 00:03:52 GMT

To clarify, while PE is a common assumption in classical control literature, it is not common in more recent nonasymptotic work.. If one were to assume PE in the state feedback setting, then injecting noise would not be necessary and better regret could be achieved -- but lower bounds tell us that this is not the case. So justifying the applicability of the assumption in this output feedback setting is crucial, and I'm happy to hear that it ends up being a mild assumption satisfied by well known controllers.

logarithmic regret bound, neurips paper, observable linear dynamical system, (2 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.56)
Information Technology > Control Systems (0.47)
Information Technology > Scientific Computing (0.40)

Add feedback

Review for NeurIPS paper: Logarithmic Regret Bound in Partially Observable Linear Dynamical Systems

Neural Information Processing SystemsFeb-8-2025, 00:03:45 GMT

In discussion the reviewers felt that the main result of the paper---that logarithmic regret is possible for LQG under sufficient observation noise---is significant and worth pointing out, especially given \sqrt{T} lower bounds for the fully observable setting. The reviewers did feel that the framing of the results can be improved, and I encourage the authors to do this for the final version. In particular 1) the result is not necessarily surprising given the noise assumptions, and it would be good to be more transparent about this, and 2) the claim (which is even present in the rebuttal) that the exploration scheme here is "strategic" in some way compared to prior results based on injecting random noise is very questionable, and it is indeed not clear that the techniques here can be extended beyond linear control.

logarithmic regret bound, neurips paper, observable linear dynamical system, (2 more...)

Neural Information Processing Systems

Technology:

Information Technology > Scientific Computing (0.40)
Information Technology > Artificial Intelligence > Machine Learning (0.40)

Add feedback

Logarithmic Regret Bound in Partially Observable Linear Dynamical Systems

Neural Information Processing SystemsJan-14-2025, 22:23:40 GMT

logarithmic regret bound, observable linear dynamical system, system identification, (9 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.62)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.61)

Add feedback